agent character
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Ulmer, Dennis, Mansimov, Elman, Lin, Kaixiang, Sun, Justin, Gao, Xibin, Zhang, Yi
Large language models (LLMs) are powerful dialogue agents, but specializing them towards fulfilling a specific function can be challenging. Instructing tuning, i.e. tuning models on instruction and sample responses generated by humans (Ouyang et al., 2022), has proven as an effective method to do so, yet requires a number of data samples that a) might not be available or b) costly to generate. Furthermore, this cost increases when the goal is to make the LLM follow a specific workflow within a dialogue instead of single instructions. Inspired by the self-play technique in reinforcement learning and the use of LLMs to simulate human agents, we propose a more effective method for data collection through LLMs engaging in a conversation in various roles. This approach generates a training data via "self-talk" of LLMs that can be refined and utilized for supervised fine-tuning. We introduce an automated way to measure the (partial) success of a dialogue. This metric is used to filter the generated conversational data that is fed back in LLM for training. Based on our automated and human evaluations of conversation quality, we demonstrate that such self-talk data improves results. In addition, we examine the various characteristics that showcase the quality of generated dialogues and how they can be connected to their potential utility as training data.
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- Oceania > Australia > Victoria > Melbourne (0.04)
- (8 more...)
- Research Report (1.00)
- Workflow (0.69)
- Leisure & Entertainment > Games (1.00)
- Information Technology (0.93)
Improving the CSIEC Project and Adapting It to the English Teaching and Learning in China
Jia, Jiyou, Hou, Shufen, Chen, Weichao
In this paper after short review of the CSIEC project initialized by us in 2003 we present the continuing development and improvement of the CSIEC project in details, including the design of five new Microsoft agent characters representing different virtual chatting partners and the limitation of simulated dialogs in specific practical scenarios like graduate job application interview, then briefly analyze the actual conditions and features of its application field: web-based Englis h education in China. Finally we introduce our effort s to adapt this system to the requirements of English te aching and learning in China and point out the work next to do.
- North America > United States (0.05)
- Asia > China > Jiangsu Province > Nanjing (0.04)
- Asia > China > Beijing > Beijing (0.04)
- Personal > Interview (0.46)
- Research Report (0.40)
- Education > Educational Setting (1.00)
- Education > Curriculum > Subject-Specific Education (0.70)